Exploiting Rich Features for Detecting Hedges and their Scope
نویسندگان
چکیده
This paper describes our system about detecting hedges and their scope in natural language texts for our participation in CoNLL2010 shared tasks. We formalize these two tasks as sequence labeling problems, and implement them using conditional random fields (CRFs) model. In the first task, we use a greedy forward procedure to select features for the classifier. These features include part-ofspeech tag, word form, lemma, chunk tag of tokens in the sentence. In the second task, our system exploits rich syntactic features about dependency structures and phrase structures, which achieves a better performance than only using the flat sequence features. Our system achieves the third score in biological data set for the first task, and achieves 0.5265 F1 score for the second task.
منابع مشابه
A Cascade Method for Detecting Hedges and their Scope in Natural Language Text
Detecting hedges and their scope in natural language text is very important for information inference. In this paper, we present a system based on a cascade method for the CoNLL-2010 shared task. The system composes of two components: one for detecting hedges and another one for detecting their scope. For detecting hedges, we build a cascade subsystem. Firstly, a conditional random field (CRF) ...
متن کاملExploiting Rich Syntactic Features for Hedge Detection and Scope Finding∗
Hedge detection and scope finding are increasingly important tasks in information extraction, especially in the biomedical natural language processing community. In this paper, a novel approach detecting hedge cues and their scopes by sequence labeling is explored. It should be emphasized that syntactic dependencies are systematically exploited and effectively integrated by a large-scale featur...
متن کاملLearning to Detect Hedges and their Scope Using CRF
Detecting speculative assertions is essential to distinguish the facts from uncertain information for biomedical text. This paper describes a system to detect hedge cues and their scope using CRF model. HCDic feature is presented to improve the system performance of detecting hedge cues on BioScope corpus. The feature can make use of crossdomain resources.
متن کاملExploiting Multi-Features to Detect Hedges and their Scope in Biomedical Texts
In this paper, we present a machine learning approach that detects hedge cues and their scope in biomedical texts. Identifying hedged information in texts is a kind of semantic filtering of texts and it is important since it could extract speculative information from factual information. In order to deal with the semantic analysis problem, various evidential features are proposed and integrated...
متن کاملHedges and Boosters in Academic Writing: Native vs. Non-Native Research Articles in Applied Linguistics and Engineering
The expression of doubt and certainty is crucial in academic writing where the authors have to distinguish opinion from fact and evaluate their assertions in acceptable and persuasive ways. Hedges and boosters are two strategies used for this purpose. Despite their importance in academic writing, we know little about how they are used in different disciplines and genres and how foreign language...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010